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Abstract: The field of biomarker research has almost reached unmanageable proportions 
in chronic obstructive pulmonary disease (COPD). The developments of new technology 
platforms have generated a huge information data base, both cross sectionally and increasingly, 
longitudinally. The knowledge emerging provides an enormous potential for understanding 
the disease pathophysiology, for developing markers specific for long-term outcomes, and 
for developing new therapeutic strategies. However, the excitement must be tempered with an 
understanding of the limitations of the data collection techniques, and of the variations in disease 
state, activity, impact, and progression. Nevertheless, the most crucial aspect in interpreting 
the current literature is the recognition of the relatively superficial characterization of what 
is a complex group of pathological processes with a common end point of airflow limitation. 
The current review explores some of these issues together with those areas where real progress 
appears to have been made, and provides caution on interpretation. 
Keyword: emphysema, inflammation, secretions, technology platforms 

Introduction 

"Biomarkers" have become a hot topic in the study and treatment of chronic obstructive 
pulmonary disease (COPD). In simple terms, a biomarker is a measurable characteristic 
that reflects the presence, severity, or state of a disease. For instance, whilst spiro- 
metry reflects the presence and dyspnea the severity or impact of COPD, temperature 
may reflect an infectious state. Furthermore, a change in the expression of a biomarker 
may reflect progression, the risk of progression, or the response to treatment, whilst 
genes and their products reflect the likelihood of disease emergence. Thus biomark- 
ers can be specific cells, molecules, genes, gene products, organ functions, or general 
clinical characteristics that reflect the damage done, or the process that the damage 
sets in motion or the process that leads to the damage, or can indicate the prognosis 
and/or response to therapy. More recently, the use of general technology platforms 
has led to an exponential increase in information, accompanied by uncertainty as to 
whether this reflects cause or effect. 

The current article attempts to review (at least in part) this enormous field together 
with some guidelines, based on potential pitfalls and on what we need to know. In 
general the "hot" questions we need answers to are: 

• Can we identify the susceptible individual before the disease state develops? 

• Can we identify "early" disease whilst it is still relatively asymptomatic? 

• Can we predict the rate of progression of the disease? 
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• Can we identify or confirm the key pathophysiological 
processes, leading to new drug discovery? 

• Can we identify factors that predict the response to cur- 
rent treatment or short Phase II clinical trials that provide 
confidence to undertake more extensive and definitive 
Phase III studies? 

• Can we be sure of the impact of our new strategies on 
the patient, in the present or future? 

Defining COPD status 

COPD is a slowly progressing disease, although the rate of 
progression varies. This is an important concept as it impacts 
upon three features of COPD, namely, the disease activity, 
its severity, and the impact on the patient. The activity of the 
disease process will influence the current or future severity in 
a time dependent manner, ie, a highly active disease process 
will produce more severe disease at an earlier age. Similarly, 
the severity of the disease will have a variable impact on the 
patient, dependent upon the rate at which it developed. Small 
changes that develop rapidly are more likely to have a greater 
impact on the younger patient than the same change developing 
slowly will have on a more elderly patient (see Figure 1). 

In addition, it is increasingly recognized that COPD, 
although defined by major changes in spirometry, has several 
distinct phenotypes (clinically and pathologically) as well as 
additional features, such as the exacerbation complex (also 
consisting of many pathophysiological processes) and the 
recognized comorbidities, which may have some common 
physiological yet distinct pathological mechanisms. 1 "Omics" 
is a neologism that aims at the collective characterization and 



quantification of pools of biological molecules that translate 
into the structure, function, and dynamics of an organism. 
Trying to understand the complexities of the enormous data 
generated by the "omics" platforms in COPD (proteomics, 
genomics, metabolomics, transcriptomics etc) is almost 
impossible without a similar high-quality and detailed patient 
characterization, and all biomarker studies need to be inter- 
preted in this light. 

Clinical biomarkers 

Chest hyperinflation, low body mass index (BMI), the use 
of accessory muscles of respiration, and prolonged expira- 
tion have always been the physical markers of the presence 
of airflow obstruction, especially that occurring with a 
predominant emphysema phenotype. However the objective 
measures of lung physiology have been the "gold standard" 
biomarkers to date. In particular, not only has spirometry 
been the lynchpin of diagnosis but forced expired volume 
in 1 second (FEVj) has been central to our description of 
severity. It is a marker of disease progression and response 
to therapy, as well as a sound predictor of mortality 2 and a 
weak indicator of future exacerbations. 3 Nevertheless, it only 
measures one aspect of COPD and relates poorly to the pres- 
ence of emphysema 4 or to patient-reported quality of life, 5 
and its deterioration is not linear with time. 6 

However spirometry remains central to our thinking of 
COPD, requires little equipment, is relatively simple to per- 
form and interpret (with appropriate training), and hence is a 
useful tool to detect "early" disease (at least when sufficiently 
advanced to fulfill the cross-sectional diagnostic criterion). 
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Figure I Line graph representation of disease progression in 3 idealized patients. Patient (Pt) A has slow disease progression and with age may notice or report few 
symptoms (ie, low impact/low disease activity/mild disease). Patient B has greater progression and symptoms become noticeable in the 60s (moderate impact/moderate 
disease activity/moderate disease). Patient C has rapid disease progression leading to symptoms in the 30s (major impact/high disease activity/mild disease). 
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Yet, even the term "early disease" is interpreted in different 
ways. To a patient, this may be understood as the stage in the 
disease when the symptoms or their impact is mild, whereas to 
physiologists, it is understood to occur when the spirometry is 
abnormal and yet is relatively close to the normal range. This 
concept is summarized in Figure 1 , showing that the speed with 
which airflow obstruction develops will influence the symp- 
toms perceived by the patient, which will clearly be dependent 
upon the activity of their lifestyle and expectations. 

Furthermore, the presence of comorbidity and other 
physiological impairment (gas trapping and defects of gas 
transfer) will also influence the overall impact on the patient. 
Indeed, the impact, as determined by health status tools, is 
also a fair guide to long-term mortality. 7 However, the poor 
correlation of health status with physiology suggests such 
tools only partly measure the same thing. Mortality is related 
to many factors, including the degree of breathlessness, 8 
exercise limitation, 9 and low BMI. 10 These markers predict 
the same end point but do not necessarily reflect the same 
thing. For this reason, composite scores, such as the BODE 
index (Body mass index, airflow Obstruction, Dyspnoea and 
Exercise), may be better predictors of long-term mortality. 11 
Because these relatively insensitive tests have been studied 
for many years, their use in assessing patients' progno- 
sis, progression, and response to treatment has been well 
documented. However, although useful as cross-sectional 
measures of cohort risks, these composite markers are less 
robust in assessing individual patients, 12 and a progression 
of health status can occur independent of spirometry, even 



though progression of both also predict mortality 12 However, 
the composite markers do fulfill two other criteria, namely, 
that changes, both by decline or improvement, also relate to 
mortality, 13 although may not be predictive of progression 
per se. 

Patient-reported outcomes 

Symptoms are included in many patient-reported outcomes 
and can be scored, in their own right, to provide both an indi- 
cator of the disease impact in individual patients but also, of 
progression and, importantly, the response to treatment. This 
alone consists of a large literature of assessment and valida- 
tion. A good start for the interested reader is the validation 
of the COPD Assessment Test and its comparison with other 
patient-reported outcomes (which again highlights the weak 
relationship to spirometry). 14 Nevertheless, these subjective 
tests do act as a biomarker of the impact of the disease on 
the patient, even if this is not entirely (or even only partly) 
explained by the current objective measures (a disjunction 
between disease severity and impact). 

Radiology 

More recently, the widespread use of high-resolution com- 
puted tomography (CT) scans and other radiological tech- 
niques has been applied to COPD populations. The CT scan is 
now the marker of choice to identify and quantify the presence 
and type of emphysema. This patient phenotype also relates 
to health status (though also weakly) as shown in Figure 2. 
Emphysema is a good predictor of mortality, 1516 can be present 
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Figure 2 Relationship between health status recorded as the SGRQ score and decreasing lung density as a marker of emphysema. Individual patient data points are shown 
and the significance of the correlation is given. 
Abbreviation: CT, computed tomography. 
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with normal spirometry, 17 and is the outcome of choice in 
treatments aimed at improving 18 or slowing down the progres- 
sion 15 of emphysema. CT scanning is recognized as the most 
sensitive test of progression in patients with emphysema, 20 
being significantly more sensitive than spirometry and even 
gas transfer, in such patients. More recently, further refine- 
ments of CT analysis have enabled the relationship between 
small airways disease and emphysema to be characterized 
in more detail, 21 and the preliminary data suggest that small 
airways disease may precede the development of radiological 
emphysema. The implications of this for our understanding 
of pathophysiology clearly requires further study. 

Radiology can also study the airways, indicating responses 
to bronchodilator therapy 22 and the presence and distribution 
of accompanying bronchiectasis, which is also a predictor of 
mortality 23 (although the reasons are currently unclear). 

Chronic bronchitis 

The chronic bronchitis phenotype received little attention as 
a marker until recently and is now recognized as a predictor 
of disease progression 24 and recurrent exacerbations. 25 The 
latter defines a further clinical phenotype (and hence marker) 
likely to respond to anti-inflammatory therapy, both by the 
inhaled route 26 and oral administration. 27 

Despite the well-tried and accepted roles of these 
"clinical" biomarkers, in more recent research, the term has 
come to mean a fluid sample test that is specific, particularly 
in terms of the pathophysiological processes taking place. 
Such research was initially based on a hypothesis-testing 
strategy, but the development and ready availability of tech- 
nology platforms has caused this to become a data-collecting 
and possibly, a hypothesis-generating strategy. The latter 
relies on extensive data collection, which is then probed for 
patterns (generally the "omics" strategy) that may become 
informative about the potential process(es) taking place. 

Biomarkers based 
on pathophysiology 

The generally accepted concept is that the development of 
COPD occurs in susceptible individuals as a result of an 
enhanced inflammatory response to inhaled substances, like 
cigarette smoke. However, not only is COPD a generically 
denned syndrome but, inflammation itself is a highly complex, 
interrelated network of cells, cytokines, and other proteins 
that both amplify and dampen down the process - in part, this 
represents a balance between the destructive elements and 
the reparative ones. However, once damage has occurred to 
the delicate structures of the lung, repair cannot completely 



restore function, especially in the alveolar regions. This has 
been best demonstrated in animal models of emphysema, 
where a single challenge has been shown to result in the loss of 
lung elastin and although it reaccumulates, the development 
of emphysema. 28 To date, the majority of biomarker studies 
in COPD have concentrated on the damage process. 

Our understanding of the mechanisms involved in the 
pathophysiology of COPD relates back to the single obser- 
vation that alpha- 1 -antitrypsin (AAT)-deficiency (AATD) 
resulted in an increased susceptibility to the development of 
early-onset basal emphysema. 29 The cascade of events that 
followed this observation resulted in many studies of AAT 
phenotypes in COPD, but apart from the recognition of the Pi 
ZZ severe form of deficiency and some very rare null genes, 
little was found. However, even the deficiency itself is not a 
highly sensitive marker since many affected patients remain 
healthy and with normal lung function. The true incidence of 
the deficiency and disease and hence, the sensitivity of even 
this genetic/biochemical marker remains unknown. 

The mechanism involved became relatively easy to define. 
AAT is a specific inhibitor of serine proteinases, and elastase 30 
and proteinase 3, 31 released by activated neutrophils, have 
both been shown to produce emphysematous lesions in 
experimental animals. Since neutrophil recruitment to the 
lung results in the release of these enzymes, elastin diges- 
tion during transit should be excessive in the presence of low 
concentrations of AAT. Completion of the pathophysiological 
axis required a chemoattractant signal and cell activation, 
adhesion, and migration, as summarized in Figure 3. The 
figure also summarizes the relevance of fluid sampling related 
to the parts of the process being studied. 

It was strongly believed that monitoring the destruc- 
tive process would identify a biomarker that would be 
increased with progression and decreased by effective 
therapy. Assays were developed to detect elastin-break- 
down products/peptides and the cross-linking peptides, 
desmosine and isodesmosine. Lung elastin is a very long- 
lived connective tissue, 32 and hence, breakdown products 
should not be a feature of health. Unfortunately elastin 
is not confined to the lung, and these elastin-breakdown 
products, despite being elevated in patients with neutro- 
philic lung diseases, do not help with the management of 
even AATD. There is some evidence that these elastin- 
breakdown products increase with disease progression 33 
but do not mark any therapeutic benefit. 34 Furthermore a 
variety of other inflammatory lung diseases and variable 
clinical states also affect the levels of those markers, 35 and 
for many reasons, their validity remains controversial. It 
is possible that only specific elastase-generated fragments 
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Figure 3 The pathological process involved in emphysema. (I) cigarette smoke activates macrophages (2), epithelial cells (3) and airway neutrophils (4) to release pro 
inflammatory cytokines and neutrophil chemoattractants. At the same time oxidant stress in smoke damages local airway proteins (3). Harvesting airway secretions (A) 
detects markers of these effects including the influence of airway colonisation (5) and local mucus over production. The chemokines activate endothelial cells and circulating 
neutrophils (6) leading to adhesion and migration. Blood biomarkers (C) reflect these events. Migrating neutrophils ± local activated macrophages destroy connective tissue 
releasing specific fragments into the lymph and together with locally produced chemokines circulate into the circulation where they can be detected (B). 
Abbreviations: LTB4, leukotriene B4; IL8, interleukin 8. 



would be useful and hence results may depend on both 
enzyme-cleavage preparation of antigen and monoclonal 
antibody. 36 More recently, renewed interest in the role of 
desmosine (an elastin cross-linking peptide) has occurred, 
reviewed extensively by Luisetti et al 34 and shown to relate 
to lung physiology. 37 The relationship with spirometry was 
weak but that to gas transfer was stronger, consistent with 
elastin breakdown being more a feature of emphysema. 
However, studies of elastin breakdown may be more rel- 
evant if assessed in airways secretions. 38 So despite the 
studies that evaluated lung elastin-breakdown products as 
far back as the 1970s and strong evidence that this is a key 
process in the development of emphysema, its validation 
as a biomarker remains an unresolved problem. 34 

Elastase and other proteinases 

The release of elastase from activated neutrophils within 
the lung tissues is necessary to cause the proteolytic dam- 
age implicated, at least, in emphysema. This would seem, 
therefore, to be a much more relevant and direct biomarker 
of this clinical/pathological phenotype. The problem is that 
although elastase release in the lung interstitium is likely to 
be critical for matrix degradation at this site, the enzyme 
is rapidly inactivated (even in AATD), especially by local 



inhibitors, even if already bound to elastin. 39 An alternative 
approach has been to measure the enzyme activity in the 
lung secretions. Sputum elastase activity is detectable in 
usual COPD but only during neutrophilic exacerbations, 40 
again reflecting the capacity of the local inhibitors. However, 
although elastase activity is more easily detectable in AATD 
patients, even in the stable state, 41 it is also present in the 
secretions of bronchiectasis patients with 42 and without 43 
cystic fibrosis. Since these latter diseases do not develop 
emphysema, the data suggest neutrophil recruitment more 
locally (bronchial circulation), sparing the interstitium, in 
response to bacterial load, 44 and overwhelming local- and 
serum-derived inhibitors play a role, potentially damaging 
the airway epithelium and local host defenses. 45 

Indeed, other enzymes are often detected in these same 
secretions, including proteinase 3, 46 metalloproteinases, 47 
and cysteine proteinases, 48 all of which have been implicated 
in emphysema but which also form part of an independent 
proteinase cascade. 49 Whatever the final pathway, all these 
enzymes and their inhibitors may become both specific and 
nonspecific biomarkers. Thus, whether these markers provide 
insight into airways disease above and beyond simple associa- 
tions or merely reflect sputum purulence (see Exacerbations 
section) remains to be determined. 
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For the emphysema process, bronchoalveolar lavage 
(BAL) would be the only viable option for the study of 
markers in the airway Although this approach has been 
used for more general inflammation markers (see General 
inflammation markers section), it has added little to the 
data because of the difficulty in obtaining a true alveolar 
sampling. However, in AATD, lavage, AAT, and elastase 
inhibitory capacity have been used to support the benefit of 
AAT augmentation therapy, 50 which has also been reflected 
in sputum studies. 51 

More recently, an alternative approach to the detection of 
lung elastase activity has been developed and at least partly 
validated. As enzyme is released from neutrophils, there is a 
finite area and time within which the enzyme remains active. 
Specific cleavage of the surrounding substrate can produce 
a biochemical footprint of enzyme activity. A specific cleav- 
age product of fibrinogen (Aa-Val360) has been shown to 
relate to lung physiology, be reproducible, increase during 
exacerbations, and to decrease with augmentation therapy, 
in AATD. 52 Concentrations were also found to be higher than 
normal in usual COPD and to be related to physiological and 
radiological measure of emphysema as well as radiological 
progression of emphysema in the lower zone; 53 this suggests 
a similar elastolytic process is at least partly responsible 
even in the presence of normal AAT. These latter findings 
may well relate to the abnormal neutrophil function seen in 
usual COPD (see The neutrophil section). Whether studies 
of similar markers of other elastolytic enzymes will also 
be informative remains to be seen. However, of interest, it 
should be noted that as part of the proteinase cascade, neu- 
trophil elastase (NE) is responsible for the activation of the 
cysteine proteinase cathepsin B. 54 Although this enzyme is 
not inhibited by AAT, the inhalation of AAT, which would 
inactivate NE, was also shown to result in the inactivation 
of cathepsin B, 55 suggesting the latter is also a downstream 
biomarker of local NE activity. In the same vein NE activates 
matrix metalloproteinases (MMPs), 56 and these enzymes are 
responsible for tumor necrosis factor (TNF)a processing, 57 
which, in its own right, drives neutrophilic inflammation. 
Thus, understanding these parts of the inflammatory cascade 
provides, not only potential therapeutic targets but also, 
potential biomarker readouts of efficacy (Figure 4). 

The neutrophil 

Since the identification of NE as a potential mediator of 
emphysema, studies of this cell have become central to our 
understanding of the end point of tissue destruction in this 
subtype of COPD. There is extensive literature documenting 




I CT destruction 
Emphysema 

Figure 4 Interrelation of proteinases and TNF. Cigarette smoking leads to TNF 
release and sequential events leading to emphysema (I). Serine proteinases released 
by recruited neutrophils activate MMPI2 and inactivate its' cognate inhibitor/s (2). 
MMPI2 inactivates alAT facilitating its own activation by serine proteinases (3). 
MMPI2 then leads to extracellular processing of the interaction of TNF with its 
receptor (4) facilitating the main pathway (I). 

Abbreviations: al AT, alpha I -antitrypsin; CT, computed tomography; MMPI2, 
matrix metallopeptidase 12; TIMP, tissue inhibitor of metalloproteinase; TNFa, 
tumor necrosis factor a. 

evidence of the activation of neutrophils and their adhesion 
molecules in COPD. However, few of these studies have 
determined whether this is COPD-specific or reflects lung 
neutrophilic disease states in general. However, airway 
neutrophils have been shown to relate to the decline in 
lung function 58 and emphysema quantified by CT scan. 59 
Nevertheless, the findings of such studies (as with all 
inflammatory mediators) beg the question of whether the 
observations are physiological, caused by the "severity" of 
the disease, or pathological, causing the severity of the dis- 
ease itself. The data implicating the neutrophil is extensive 
but remains circumstantial. Although it is recognized that 
the degree of emphysema relates to the amount of NE in the 
lung 60 and that NE destroys elastin, for this to be pathologi- 
cal would require that there be an abnormal response but 
not in those not demonstrating the phenotype (ie, healthy 
age-matched smokers). AATD provides another control since 
the pathological mechanism (low AAT) is well established, 
and disease severity (to obviate any physiological response) 
and treatment (to overcome any influences of drugs) can be 
matched to patients with usual COPD. 
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It was recognized in 1987 that the polymorphonuclear 
leukocytes (PMN) from emphysema patients with COPD 
behaved abnormally. The cells demonstrated a rapid motility 
and an excessive ability to destroy connective tissue. 61 
This was related to increased endothelial adhesion under- 
flow 62 and reflects a defective chemotactic response to routine 
chemoattractants. 63 These features are not found in healthy 
smokers and matched patients with AATD. The defect has 
been linked to the abnormal expression of phosphoinositide 
3 (PI3) kinase and may reflect excessive production by phos- 
phatidylinositol (3,4,5)-triphosphate (PIP 3 ), a genetic defect 
of regulation, or defective breakdown by the phosphates, 
phosphatase and tensin homolog (PTEN) and SH2 domain- 
containing inositol 5'-phosphatase 1 (SHIP1). Thus further 
studies are required to determine the exact mechanism, but it 
does reflect a pathologically relevant trait that needs further 
validation, including determining whether this is a feature of 
susceptible smokers alone and relates to the rate of progression. 
Nevertheless, recent data has shown that activated neutrophils 
in usual COPD colocalize to the area of the lung where the 
emphysema develops. 59 The signal is also clearly different to 
healthy controls and AATD patients and provides a platform 
for early Phase II studies of neutrophil-modifying therapies. 

This whole pathophysiological hypothesis of the role 
of the neutrophil in the development and progression of 
disease (at least of emphysema) has therefore a long and 
well-established history based on a human genetic model. 
With all this knowledge, the potential for biomarker discov- 
ery would seem relatively straightforward, and yet, no such 
validated biomarker that fulfils all the necessary require- 
ments (marks the disease process, reflects the activity of the 
disease process, is stable, predicts progression, and identifies 
patients amenable to therapy) has emerged. With this lesson 
as an example, all other markers need similar understand- 
ing and background before true advances in specific disease 
management can occur. 

The eosinophil 

The eosinophil is more classically associated with asthma. 
However, one study in COPD has shown that an increase in 
the proportion of eosinophils in induced (or spontaneous) 
sputum is indicative of a good spirometric response to inhaled 
corticosteroids. 64 Furthermore a recent publication has also 
indicated that this marker (reflected in blood eosinophils) 
also identifies patients who are likely to benefit from oral 
steroid therapy for exacerbations. 65 Importantly, the study 
also suggested that the noneosinophilic group faired less 
well with steroid therapy. These studies would be consistent 



with an asthma/COPD overlap syndrome and provide guid- 
ance for a more steroid-orientated treatment strategy. Hence, 
eosinophils can be seen to fulfill one of the requirements for 
a biomarker. Although bronchodilator reversibility, per se, 
influences spirometric decline, 66 it remains unknown whether 
it is the eosinophilic subset that marks the susceptible, yet 
steroid-responsive patient or predicts future progression. 

Oxidative stress 

The other key process implicated in the pathophysiology of 
COPD is oxidative stress. Cigarette smoke itself contains high 
concentrations of oxygen-free radicals with the potential to 
damage tissues and proteins, and stimulate inflammation. 
Despite an efficient antioxidant defense, there has been clear 
and repeated evidence of oxidant stress in COPD. Again, 
whether this is pathological or physiological (a general 
response-to-smoking) remains unclear. This topic was exten- 
sively reviewed in 2008 67 and more recently in 20 13, 68 and 
the oxidant and proteinase story has been brought together 
in a further recent review. 69 However, to date, the markers of 
oxidative stress have not been widely used other than to reflect 
the process itself. Validation of these markers and their use in 
supporting effective antioxidant therapy is awaited. 

Genetic studies have, however, cast some light on the 
potential pathological, rather than physiological, nature of 
the ability to deal with oxidative stress (see Technology 
platforms section). 

Inflammation markers 

Although the preceding sections addressed relatively specific 
marker studies in COPD that were based on clinical and 
pathophysiological processes, the rapidly expanding literature 
on biomarkers has taken a more generic stance. It is widely 
accepted that the majority of diseases in the COPD syndrome 
have inflammation as the key underlying mechanism, and 
studies of a vast array of mediators have been undertaken, 
addressing three distinct questions. 

1 . Is it/are they elevated in COPD? 

2. Does it/do they relate to the physiological markers 
(usually spirometric) of severity? 

3. Does it/do they reflect a specific patient phenotype or a 
therapeutic target? 

Using the search terms "biomarkers and COPD", in 
excess of 1,000 papers can be found that have been pub- 
lished since 2008. It is clearly beyond the scope of the 
current review to discuss such a large body of literature, so 
selective aspects have been chosen to illustrate the general 
principles required for interpretation (as few have emerged 
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to answer most of the questions outlined in the introduction). 
C-reactive protein (CRP) is probably the most nonspecific 
marker of inflammation and yet, has been widely studied 
in COPD. At present, over 300 articles can be identified 
via PubMed using the search words of "CRP and COPD". 
CRP levels, even those within the accepted normal range, 
predict future cardiac events in a general population, 70 and 
they are implicated in the pathophysiology of vascular dis- 
ease. 1 There is an inverse relationship between plasma CRP 
levels and lung function, even in subjects assumed to be 
healthy. 71 The levels are elevated in COPD patients without 
evidence of ischemic heart disease; these are stable and are 
reduced with inhaled corticosteroids, 72 although this does 
not reflect a disease-modifying effect. 

There has been no clear mechanism to implicate CRP 
in the pathophysiology of COPD, nor does it reflect known 
genetic polymorphisms, suggesting it is merely a general 
marker of the underlying inflammatory process associated 
with COPD. 73 However, the data did suggest that high CRP 
was associated with an increased odds ratio for hospital- 
ization, although probably reflecting disease severity. The 
conclusions were based on a large population cohort and 
as such, only indicate trends and will have little impact on 
managing individual patients. 73 

CRP levels rise during exacerbations particularly when 
there is an increased neutrophilic influx due to a bacterial 
cause. 74 In addition, a raised CRP in the stable state predicts 
recurrent exacerbations due either to a failure to completely 
resolve the first episode or a persistent underlying airway 
colonization that predisposes to further episodes. 75 

Fibrinogen has also been widely studied in COPD and has 
been well covered in a recent review. 76 Fibrinogen is also an 
acute -phase protein and is raised in COPD (as part of the sys- 
temic effects of the disease), increases during exacerbations, 
and is higher in those with frequent exacerbations and those 
with more severe airflow obstruction. It may also play a role 
in comorbidities, especially vascular disease and potentially 
cardiovascular death due to an increased clotting tendency. 
Again, no genetic polymorphisms have been identified indi- 
cating fibrinogen is also more likely to be a reflective feature 
of COPD. Although studies indicate that therapy can reduce 
fibrinogen levels, it remains unknown whether this affects 
progression or other outcomes. If so, fibrinogen could act as 
an ideal biomarker at least for some aspects of COPD, and its 
management and further studies are awaited. The references 
related to these data are included in the review article. 76 

Many other inflammatory markers have been measured 
in COPD, and some seem to be persistently raised. The non- 
specific nature of these changes provides little insight into 



management or pathophysiology. Surfactant protein D (SPD) 
is a lung-derived protein. SPD is elevated in the plasma in 
COPD and falls with corticosteroid therapy, 77 suggesting 
that steroids reduce systemic leak from the lungs as part of 
a general anti-inflammatory effect. However, in one study, 
the changes in SPD following inhaled therapy were also 
reflected in the degree of improvement of health status and 
FEV in the short term, 78 suggesting that lung inflammation 
itself affects these features. Of interest, the CRP and inter- 
leukin (IL)-6 did not change, suggesting that the effects were 
acting locally in the lung and not influencing other aspects 
of generic systemic inflammation (although the SPD effect 
could be measured systemically). 

Studies are beginning to address the requirements of an 
ideal biomarker, with therapeutic interventions for Phase II 
and Phase III studies in mind. For example, fibrinogen was 
used as a marker in a Phase II study of p38 mitogen-activated 
protein kinase (MAPK) inhibitors, 79 which effectively block 
the proinflammatory pathway. The treatment clearly had an 
effect, as determined by reducing fibrinogen levels. How- 
ever, whether this pathway is merely physiological, rather 
than pathological, remains unknown, and hence, fibrinogen 
still remains uncertain as a therapeutic marker option in the 
long-term or short-term management of COPD. Neverthe- 
less, this study represents a first step toward the development 
and assessment of new therapies using a biomarker, perhaps 
with a cardiovascular benefit (in those with this comorbidity), 
rather than a pulmonary one. 

Other inflammatory markers are measurable and elevated 
in the plasma, and although none have proven to be specific, 
an alternative has been to measure several to develop a 
multicomponent approach. 80 Using the Evaluation of COPD 
Longitudinally to Identify Predictive Surrogate Endpoints 
(ECLIPSE) cohort, 80 the authors measured several blood 
biomarkers that were increased in COPD patients compared 
with healthy smokers and nonsmokers. The white blood cell 
(WBC) count, CRP, and fibrinogen levels fitted this criterion, 
as did IL-6, IL-8, andTNFoc. The proportion outside the 95th 
percentile of nonsmokers was networked at baseline and at 
1-year follow-up to determine consistency. A subgroup (16% 
of the patients) with persistently elevated inflammatory mark- 
ers (two or more) was identified who had a higher incidence 
of exacerbations (cause or effect?) and worse survival. 

This approach of selecting certain markers or traits to 
identify patient groups with common features is the basis 
for "cluster analysis", which may be informative but clearly 
depends on the traits or factors selected and always needs 
validation in a second patient cohort; such studies are 
awaited. 
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The ECLIPSE cohort has been used to study a large 
number of other potential biomarkers. In 2011 Dickens 
et al reported on the stability of 34 blood biomarkers over 
3 months. 81 The importance of stability is that single measure- 
ments could be informative of prognosis and particularly, may 
reflect the underlying disease activity that leads to progres- 
sive deterioration. Fifteen of the markers were different in 
COPD compared with the controls, and four reflected patients 
who had exacerbations between baseline and follow-up 
(CRP, fibrinogen, IL-6, and SPD), but few were classified 
as stable and hence useful as markers of baseline disease 
activity (fibrinogen, Clara cell protein [CC]16, and SPD). 
Although there were weak correlations of the other markers 
with spirometry, fibrinogen, which did not correlate, emerged 
as the most promising as it was stable and was a marker of a 
frequent exacerbation phenotype, exercise impairment, the 
severity of the BODE index, and dyspnea. 

However, of these "stable" markers, only CC16 showed 
both a relationship to baseline spirometry and the subse- 
quent annual decline. 82 At present, only this protein and 
the Aa-Val360 (see Elastase and other proteinases section) 
biomarker have shown such characteristics. CC16 is an 
immunosuppressant and antioxidant produced in the airways. 
The fact that it is elevated in COPD suggests it is reflective 
of the disease, rather than causative; however, it is possible 
that its measurement in serum (like SPD) reflects leak- 
age from the lung due to local inflammation, which may 
be the generic process associated with, rather than causing, 
decline. Nevertheless, this series of studies demonstrates 
several of the steps needed to validate a biomarker, which 
if subsequently implicated in the pathophysiology, will also 
direct thinking to new specific therapeutic targets. 

SPD levels in blood are also raised in COPD and respond 
to steroid therapy as indicated above. Since low SPD levels 
in the lung have been implicated in the pathophysiology 
of COPD, 83 are reduced in BAL in COPD, 84 and relate to 
spirometry, 85 it fulfils more of the pathophysiological role 
than does CC16. However, although SPD responds to oral 
steroids with a decrease in plasma levels, it is unknown what 
happens in the lung in the same patients. Nevertheless, in a 
separate study, COPD was associated with low lung levels of 
SPD, and inhaled steroid therapy was associated with higher 
levels, 84 suggesting that SPD is a good marker of disease 
and response to treatment. Subsequently, a polymorphism 
of the SPD gene was identified that also relates to COPD 86 
and that is reflected in the level of SPD - at least in the 
serum (transcription and lung levels are currently unknown). 
This latter point is of importance: whereas this appears 
to "close the loop", it should be noted that a functional 



polymorphism of SPD that is of pathophysiological impor- 
tance should result in low lung levels of SPD, but the presence 
of airway inflammation and leak may, counterintuitively, lead 
to high blood levels. Nevertheless, the studies do suggest the 
potential for local therapy with SPD in COPD and that it is 
certainly worth pursuing. 

Sampling issues 

Studies of the pathophysiology of COPD are made all the 
more complex because of the different pathological pheno- 
types and the fact that the site of damage/repair is in the local 
tissues. Clearly this is not an issue if the biomarker is either 
genetic (as in AATD), or systemically reflects the disease 
process or specifically samples the disease area. To date, 
biomarkers have been studied in exhaled breath condensate, 
spontaneous or induced sputum, BAL, or serum/plasma. All 
of these have inherent problems in interpretation, especially 
when using high-throughput methodology. 

Firstly, many commercially available assays have not 
been validated using the biological medium collected 
and processed for the measurement (especially airway 
secretions) and can produce inaccurate results. 87 Secondly, 
the mediator measurement (especially in exhaled breath 
condensate) may be detectable but often below the lower 
limit of quantification. 88 Thirdly, the biological medium may 
not specifically sample the area where the disease process 
is occurring. For instance, sputum or induced sputum is 
unlikely to represent samples derived from the small airways 
or alveoli, and BAL does not exclusively sample the small 
airways or alveolar region. In addition, the variability between 
samples from the lung is often wider than the differences 
between subjects or clinical states. For instance, markers in 
spontaneous sputum may have a day to day variability of up 
to 200%, for many reasons. 89 Obtaining samples sequentially 
and averaging the results drastically reduces this variability, if 
the patient is stable; 89 however, this approach cannot easily be 
applied to invasive procedures, such as BAL or induced spu- 
tum, since the introduction of saline or hypertonic saline itself 
is proinflammatory, and several days are needed between 
sampling to enable a return to the stable state. However, 
serum or plasma stability is much more reliable, at least for 
some biomarkers. 81 Pathological specimens provide an oppor- 
tunity to sample individual cells and their nature/metabolic 
activity, by laser capture methodology, but other sampling 
techniques provide varying information from normal and 
abnormal tissue and may not reflect the pathological area 
unless performed on lung resection material within defined 
pathological areas. Finally, mediators measured in blood 
samples may reflect processes occurring in the interstitium, 
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recirculated from lymph but not necessarily the airways. The 
subsequent volume of distribution, not only dilutes mediators 
but also, clears them by appropriate receptor binding. 90 Even 
then, studies like proteomics and metabolomics will differ in 
plasma compared with serum, due to coagulation and also, 
by sample collection, distribution, processing, and storage. 
Careful matching with appropriate controls and interpreta- 
tion of data is therefore essential in all studies but is rarely 
done or reported, making the comparison between studies 
relatively tenuous. 

Exacerbations 

Exacerbations are an important contributor to health status, 
health care utilization, and death, particularly if leading to 
hospitalization. The definition of this symptom-led event, its 
complexity, and the broad approach to treatment and preven- 
tion has generated a search for an objective measure of an 
exacerbation and therapeutic guidance. 

In general, exacerbations are events that are associ- 
ated with an increase in inflammation, both in the lung 
and systematically. 91 However, in general, the events may 
reflect a bacterial or viral cause or be related to other causes 
of increased airway obstruction or may even reflect the 
destabilization of other organ systems, such as in associated 
cardiovascular decompensation. 92 The main decision required 
for such events is whether to increase bronchodilator therapy 
alone, to treat with corticosteroids or antibiotics, or both. 
Of these, the use of antibiotics has received most attention 
because overuse is likely to lead to the continued develop- 
ment of bacterial resistance. 

Serum procalcitonin is normally produced by neuro- 
endocrine cells of the thyroid and lung but is also produced 
elsewhere in the body, in response to bacterial infection. For 
this reason, it has been used to predict bacterial infections 
and outcome in patients with sepsis. 93 However in general 
this approach has been unsuccessful in COPD. 9495 Serum 
amyloid A has also been suggested for this as it is increased 
during exacerbations with a viral or bacteriological aetiol- 
ogy; 96 however, it failed to differentiate them, despite reflect- 
ing severity of the episode, and hence would not facilitate 
treatment guidance. 

To date, the best two markers that influence therapy are 
symptom deterioration with known or current eosinophilia, 
which likely benefit from steroid therapy, 65 or sputum color. 
The latter reflects the presence or absence of a large neutro- 
philic load, imparting various shades of green. A change in 
color reflects an increase or decrease in sputum neutrophilia. 
The color can be characterized at least semiobjectively by 



both the patient and health care worker, using a standard 
color chart. In the initial study using this approach in pri- 
mary care, there was high sensitivity to bacteria isolated 
from sputum and in particular, an increase relating to the 
colonizing microbial load, even in the stable clinical state. 44 
This approach, using sputum color, enabled antibiotics to be 
withheld without detriment from patients with white or color- 
less sputum 74 and also enabled monitoring of the antibiotic 
response by the patients. 97 Subsequent studies have confirmed 
the robust nature of this approach in identifying patients 
with significant colonization 98 and also, that withholding 
antibiotics from those without yellow or green sputum had 
no adverse effects. 94 Interestingly the authors found that CRP 
and not procalcitonin was increased in patients with purulent 
sputum, suggesting that CRP may be a better discriminator 
for bacterial infection. Clearly, further clinical trials are 
indicated to consolidate this approach. 

Technology platforms 
and the "omics" revolution 

In recent years, the ability to obtain multiple assay kits com- 
mercially and the development of the proteomics, metabolo- 
mics, transcriptomics, and genomics approaches to patient 
assessment has led to an overwhelming plethora of data. This 
does lead to some technical issues: the assay may not have 
been validated for the biological sample or the processing 
methodology used may lead to spurious and unrecognized 
effects on the result, both within and between patients. In 
addition, the data generated has a sensitivity that far out- 
weighs our current ability to phenotype patients sufficiently 
from a clinical, physiological, and radiological viewpoint. 
It may be that this approach will lead to a pathophysiological 
classification of COPD that negates clinical phenotyping, 
although this remains to be determined and would have to be 
validated against the clinical "gold standard". 

Proteomics (with 75 citations in the past year) is based on 
the identification of COPD proteins in biological fluids by a 
multitude of techniques, which produce a two-dimensional 
spectrum of data (Figure 5) to determine the similarities or 
differences between subjects. This demonstrates the amount 
of data that can be generated by the "omics" approach. The 
benefit is that this is not a "hypothesis-led" approach, and 
new data can emerge as "hypothesis generating". Neverthe- 
less, such an approach can lead to false-positive and false- 
negative results, and new statistical approaches to the data 
are needed. 99 

Indeed it has even been suggested that combining the data 
from different "omics" platforms, as part of an integrated 
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system biological approach to understanding lung diseases, 
may be necessary to make progress. 100 The methodology 
is advancing at great pace. The review by Nicholas et al 101 
defines the principles and aims of the methodology and is 
worth reading, for those interested. 

Less has been published on metabolomics, but in a small 
study of smokers with emphysema, the assessment of 3,534 
metabolite markers was able to separate smokers with and 
without emphysema, 102 and the top 12 were used for predic- 
tive models, some of which produced clear separation (as 
good as CT scan?). 

Genetic studies have dominated the COPD world since 
the early 2000s, with both a candidate gene approach and 
genome-wide association studies. The data have aimed at 
detecting the genetic markers of COPD predisposition, 
thereby leading to new therapies based on gene function 
(over- or underactive) or a preventative approach to the 
disease(s). Many of the genetic markers failed to identify 
polymorphisms that influence the targeted gene function, 
but associations with COPD or its features suggested the 
marker was in linkage disequilibrium, probably with a nearby 
functional abnormality. Such susceptibility markers known 
to be associated with functional abnormalities include matrix 



metallopeptidase 9 (MMP9) and apical emphysema, 103 TNFa 
and bronchitis, 104 superoxide dismutase and emphysema, 105 
iron-responsive element binding protein 2 (IREB2) and 
COPD, 106 andIL-13 promoter polymorphisms and an interac- 
tion with cigarette smoking on FEVj. 107 Nevertheless, such 
associations represent only minor contributors to the COPD 
complex that could reflect either the superficial phenotyping 
of patients or indeed the complexity of the disease(s). 108 

Such studies do reflect potential susceptibility genes and 
hence, pathophysiological pathways, but how or whether they 
reflect the rate of progression or long-term outcomes has yet 
to be determined. Nevertheless, in other diseases, treatments 
have been influenced by genetic studies, suggesting the same 
may follow for COPD, at least for those associated with a 
functional contribution. 109 It may be that a panel of genes or 
a panel of gene transcripts may provide further insight into 
progression or the response to treatment, although far from 
personalized medicine. 110 Interestingly, a recent study 111 has 
suggested that a polymorphism in the a disintegrin and met- 
alloprotease domain 33 (ADAM33) gene is associated with 
all-cause and cardiovascular mortality in COPD. The group 
of proteins encoded by this gene domain play important roles 
in cell adhesion, migration, and proteolysis, and ADAM33 
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Table I Biomarkers in context 



Sample 


Measures 


Context 


Problems 


Benefit 


Plasma/ 


Cytokines/peptides/products 


Systemic inflammation 


Coagulation may activate cells/ 


Easy to obtain 


serum 


Acute phase proteins 


Systemic response 


proteins/peptides 


Repeated sampling acceptable 




Cell activation markers 


Inflammatory cells 


Distant from the lung 


No dilutional problems 




and secreted/shed products 


Tissue damage 


May not be lung specific 






Tissue damage products 


Lung inflammation 


Clearance of cytokines by receptor 






Lung leakage 




binding 




Exhaled 


Cytokines 


Airways/ 


Many measures are below the lower 


Repeated sampling 


breath 


Markers of oxidant stress 


oropharyngeal 


limit of quantification/detection 




condensate 


pH as a marker 
of inflammation 


inflammation 


Site not localized 

Variable dilution by condensate 




Lung lavage 


Cytokines/peptides 


More localized 


Lavage of many regions 


More direct lung sampling 




Markers of oxidant stress 


to the lung 


Bronchial contamination 


Can be regionally targeted 




Cells number and function 




Variable dilution by instillate 






Protein leakage from plasma 




Not readily repeatable 






Damage markers 




Proinflammatory 




Sputum/ 


Cytokines/peptides/proteins 


More localized 


Samples bronchial secretions 


Spontaneous sputum is readily 


induced 


Markers of oxidant stress 


to the lung 


Oropharyngeal contamination 


repeatable 


sputum 


Cell number and function 




Variable dilution 


Variability can be reduced by 




Damage markers 




Induced is proinflammatory 


sequential sampling 


DNA 


Variation in genetic 


Underlying 


Often nonfunctional 


Not subject to collection or disease 




sequence 


susceptibility and 
pathophysiology 


May be indirect, reflecting near- 
by genetic abnormalities 


state influences 


Physiology 


End-organ damage 


Assesses the change 


Does not determine the degree, 


Reasonably well understood 






in lung function 


site, or pathological damage 


Correlates with health status, 






as a result of damage 


with precision 


activity, and mortality 


CT scan 


Lung density 


Localizes abnormalities 


Radiation exposure 


Identifies pathology 




Changes in architecture 


that influence 
clinical outcome 


Repeat measures limited 


Predicts mortality 

Sensitive to emphysema progression 


Biopsy 


Cells 


Lung tissue 


Variability between samples 


It is the most direct way of studying 




Architecture 




Not easy to repeat 


the process in situ 




Cytokines 




Only samples a small portion 






Damage products 




of the lung 





Abbreviations: CT, computed tomography; DNA, deoxyribonucleic acid. 



has also been implicated in the rapid decline in lung func- 
tion and development of COPD. 112113 Whether this reflects a 
functional abnormality and hence, a treatment target needs 
to be determined. 

Summary 

Biomarker discovery has become a rapidly expanding field 
of research primarily to develop easy and early readouts 
for delivery of Phase II clinical studies and a shortening of 
pivotal Phase III studies that displace conventional markers 
like FEV r A variety of methodologies and sample collec- 
tions have been undertaken, measuring things as diverse as 
the mechanics of the lung, patient symptomatology, markers 
of exposure (such as inhaled oxidants from cigarette smoke), 
markers of tissue damage (connective tissue degradation 
products), number and function of implicated cells, general- 
ized inflammatory markers, cell-activation products, under- 
lying proteins (their function, metabolism, and source), and 



the genetic background of the patients. These have involved 
a variety of biological samples collected in different ways, 
all with both benefits and drawbacks and all requiring cau- 
tion in interpretation, as summarized in Table 1 . However, 
the identification and understanding of such markers, either 
in a hypothesis-testing or hypothesis-generating approach, 
may prove sufficiently informative for understanding the 
disease process and for developing new therapeutic strategies. 
Nevertheless, without accurate patient characterization, bio- 
marker validation, and an understanding of the issues that 
likely influence the measurements (including current therapy 
and dissecting cause and effect), this process may prove to 
be both fruitless and confusing. 
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